Multivariate Statistical Methods for Analyzing Genetic Association Studies

نویسنده

  • Thorsten DICKHAUS
چکیده

Genetic association studies lead to simultaneous categorical data analysis. The sample for every genetic locus consists of a contingency table containing the numbers of observed genotype-phenotype combinations. The goal of the statistical analysis is to detect associations between the (potentially very large) set of genetic markers and the (typically binary) phenotype of interest. This is a particular multiple test problem which has several challenging aspects, for instance the high dimensionality of the statistical parameter and the discreteness of the statistical model. Furthermore, the locus-specific contingency tables exhibit strong dependencies, at least in blocks of loci which are in linkage disequilibrium (LD), due to the biological mechanism of inheritance. This makes a multivariate statistical analysis the method of choice. In the first part of the presentation, we will consider frequentist multiple test procedures which are based on the concept of the effective number of tests based on probability bounds, see [1,2] and Section 4.3 of [3]. Such procedures incorporate LD information in a relaxed multiplicity correction of Bonferronior Šidák-type. Due to the extended interpretation of LD provided in [4], this methodology is applicable for a variety of families of test statistics. The second part is based on [5] and deals with Bayesian approaches to contingency table inference for genetic association data. Here, the multiplicity correction is performed via an appropriate construction of the prior probabilities for the validity of the locus-specific null hypotheses of no association. Exploiting the conjugacy of Dirichlet and multinomial distributions, posterior probabilities for the nulls can exactly be computed for any finite sample size, and decision theoretic multiple test procedures can be applied.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Methods for Analyzing Multivariate Phenotypes in Genetic Association Studies.

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Multivariate phenotypes are frequently encountered in genetic association studies. The purpose of analyzing multivariate phenotypes usually includes discovery of novel genetic varian...

متن کامل

metaCCA: summary statistics-based multivariate meta-analysis of genome-wide association studies using canonical correlation analysis

MOTIVATION A dominant approach to genetic association studies is to perform univariate tests between genotype-phenotype pairs. However, analyzing related traits together increases statistical power, and certain complex associations become detectable only when several variants are tested jointly. Currently, modest sample sizes of individual cohorts, and restricted availability of individual-leve...

متن کامل

Genetics and population analysis metaCCA: summary statistics-based multivariate meta-analysis of genome-wide association studies using canonical correlation analysis

Motivation: A dominant approach to genetic association studies is to perform univariate tests between genotype-phenotype pairs. However, analyzing related traits together increases statistical power, and certain complex associations become detectable only when several variants are tested jointly. Currently, modest sample sizes of individual cohorts, and restricted availability of individual-lev...

متن کامل

Genetic Variation of Seed Related Traits in Festuca arundinacea Using Multivariate Statistical Methods

Genetic diversity is the basis of breeding studies in many plant species and is one of the most important indicators for selecting parents. The aim of this experiment was to investigate the genetic diversity of tall fescue (Festuca arundinacea) using agronomic traits such as plant height, spring growth score, days to flowering, days to pollination, flag leaf length and width, panicle length, we...

متن کامل

Comparison of several sequence-based association methods in pedigrees

Genome-wide association studies are very powerful in determining the genetic variants affecting complex diseases. Most of the available methods are very useful in detecting association between common variants and complex diseases. Recently, methods to detect rare variants in association with complex diseases have been developed with the increasingly available sequencing data from next-generatio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015